Compositional Mediation Analysis for Microbiome Studies
نویسندگان
چکیده
Motivated by recent advances in causal inference on mediation analysis and problems in the analysis of metagenomic data, we consider the effect of a treatment on an outcome transmitted through microbes, or compositional mediators. Compositional and high dimensional natures of such mediators make the standard mediation analysis not directly applicable. In this paper, we propose a method for estimating the expected causal direct and indirect (or mediation) effects for a sparse high-dimensional compositional mediation model utilizing the algebra for compositions in the simplex space and the nature of compositional data. The test of the total and component-wise mediation effects are also proposed. We conduct extensive simulation studies to assess the performance of the proposed method and apply the method to a real metagenomic dataset to investigate the effect of fat intake on body mass index (BMI) mediated through the gut microbiome composition. . CC-BY 4.0 International license peer-reviewed) is the author/funder. It is made available under a The copyright holder for this preprint (which was not . http://dx.doi.org/10.1101/149419 doi: bioRxiv preprint first posted online Jun. 13, 2017;
منابع مشابه
Microbiome Datasets Are Compositional: And This Is Not Optional
Datasets collected by high-throughput sequencing (HTS) of 16S rRNA gene amplimers, metagenomes or metatranscriptomes are commonplace and being used to study human disease states, ecological differences between sites, and the built environment. There is increasing awareness that microbiome datasets generated by HTS are compositional because they have an arbitrary total imposed by the instrument....
متن کاملNext Generation Sequencing and its Application in the Study of Microbiome in Plant Diseases Suppressive Soils
Progress in next-generation sequencing has played a significant role in ecological studies of microbial populations. These advances have led to a rapid evaluation in metagenomics studies (analysis of DNA of microbial communities without the need to culture). Many statistical and computational tools and metagenomics databases have led to the discovery of huge amounts of data. In this research, i...
متن کاملGenome analysis Temporal probabilistic modeling of bacterial compositions derived from 16S rRNA sequencing
Motivation: The number of microbial and metagenomic studies has increased drastically due to advancements in next-generation sequencing-based measurement techniques. Statistical analysis and the validity of conclusions drawn from (time series) 16S rRNA and other metagenomic sequencing data is hampered by the presence of significant amount of noise and missing data (sampling zeros). Accounting u...
متن کاملAnalysis of High Dimensional Compositional Data Containing Structural Zeros with Applications to Microbiome Data
This paper is motivated by the recent interest in the analysis of high dimensional microbiome data. A key feature of this data is the presence of ‘structural zeros’ which are microbes missing from an observation vector due to an underlying biological process and not due to error in measurement. Typical notions of missingness are insufficient to model these structural zeros. We define a general ...
متن کاملThe Use of Robust Factor Analysis of Compositional Geochemical Data for the Recognition of the Target Area in Khusf 1:100000 Sheet, South Khorasan, Iran
The closed nature of geochemical data has been proven in many studies. Compositional data have special properties that mean that standard statistical methods cannot be used to analyse them. These data imply a particular geometry called Aitchison geometry in the simplex space. For analysis, the dataset must first be opened by the various transformations provided. One of the most popular of the a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017